Learning Inter-Task Transferability in the Absence of Target Task Samples

نویسندگان

  • Jivko Sinapov
  • Sanmit Narvekar
  • Matteo Leonetti
  • Peter Stone
چکیده

In a reinforcement learning setting, the goal of transfer learning is to improve performance on a target task by re-using knowledge from one or more source tasks. A key problem in transfer learning is how to choose appropriate source tasks for a given target task. Current approaches typically require that the agent has some experience in the target domain, or that the target task is specified by a model (e.g., a Markov Decision Process) with known parameters. To address these limitations, this paper proposes a framework for selecting source tasks in the absence of a known model or target task samples. Instead, our approach uses meta-data (e.g., attribute-value pairs) associated with each task to learn the expected benefit of transfer given a source-target task pair. To test the method, we conducted a large-scale experiment in the Ms. Pac-Man domain in which an agent played over 170 million games spanning 192 variations of the task. The agent used vast amounts of experience about transfer learning in the domain to model the benefit (or detriment) of transferring knowledge from one task to another. Subsequently, the agent successfully selected appropriate source tasks for previously unseen target tasks.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Investigation of Spoken Output and Intervention Types among Iranian EFL Learners

This study was inspired by VanPatten and Uludag’s (2011) study on the transferability of training via processing instruction to output tasks and Mori’s (2002) work on the development of talk-in-interaction during a group task. An interview was devised as the pretest, posttest, and delayed posttest to compare four intervention types for teaching the simple past passive: traditional intervention ...

متن کامل

The Comparative Effect of Task Type and Learning Conditions on the Achievement of Specific Target Forms

The completion mode (individual, collaborative) of the tasks and the conditions under which these modes are performed have been reported to play an important role in language learning. The present study aimed to investigate the effects of employing text editing tasks performed both individually and collaboratively, on the achievement of English grammar under explicit and implicit learning condi...

متن کامل

The Effect of Written Corrective Feedback on the Accuracy of Output Task and Learning of Target Form

The effect of error feedback on the accuracy of output task types such as editing task, text reconstruction task, picture cued writing task, and dictogloss task, has not been clearly explored. Following arguments concerning that the combination of both corrective feedback and output makes it difficult to determine whether their effects were in combination or alone, the purpose of the present st...

متن کامل

Task-Induced Involvement in L2 Vocabulary Learning: A Case for Listening Comprehension

The study aimed at investigating whether the retention of vocabulary acquired incidentally is dependent upon the amount of task-induced involvement. Immediate and delayed retention of twenty unfamiliar words was examined in three learning tasks( listening comprehension + group discussion, listening comprehension + dictionary checking + summary writing in L1, and listening comprehension + dictio...

متن کامل

Investigating the relationship between dimensions of occupational, personal, support and task factors with professional learning activities of teachers

The present study aimed to investigate the Relation of professional, personal, support, and task factors dimensions on professional learning activities of male secondary school teachers of Urmia Education Department (District 1) during the academic year 2017-2018. This applied and descriptive-survey research study in terms of its aim and method. The target population included 336 teachers out o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015